CDS
Accession Number | TCMCG019C27299 |
gbkey | CDS |
Protein Id | XP_022957703.1 |
Location | complement(join(3699520..3699684,3700795..3700917,3700996..3701110,3701433..3701531,3701615..3701667,3701782..3701938,3702501..3702559,3702694..3702761,3702839..3702982,3703386..3703449,3704227..3704277,3704356..3704495,3704669..3704808,3704985..3705070,3705148..3705293,3705405..3705477,3706801..3706876,3707600..3707707,3707878..3707924,3708561..3708606,3708732..3708844,3708968..3709003,3709110..3709313,3709895..3709960)) |
Gene | LOC111459167 |
GeneID | 111459167 |
Organism | Cucurbita moschata |
Protein
Length | 792aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA418582 |
db_source | XM_023101935.1 |
Definition | DNA mismatch repair protein MSH4 [Cucurbita moschata] |
EGGNOG-MAPPER Annotation
Sequence
CDS: ATGGAAGACGACGGAGGCGAGAGATCGAGCTACGTGATCGGTCTGATCGAGAACAGAGCTAAGGAGGTTGGAGTTGCTGCGTTCGATTTGAGATCAGCTTCACTTCATCTTTCTCAATATATAGAGACCAGCAGCTCCTACCAGAATACAAAAACTTTGCTGCATTTCTATGATCCAATGGTGATTCTAGTTCCTCCCAACAAGCTCGCACCTGATGGCATGGTTGGAGTTTCTGTTTTGGTAGATAAATTTTATGTTACAGTGAAGAAGGTTGTAATGGCTCGTGGTTGCTTTGACGACACAAAGGGTGCTGTTTTGATTAAGAATCTGGCAGCCAAGGAGCCTTCTGCTCTTGGTTTGGAAACTTATTACAAACAGTACTATCTCTGCTTGGCTGCTGCTGCTGCTAGCATTAAATGGATAGAAGCAGAGAAGGGGGTTATTGTGACCAATCACTCTTTAACGGTCACATTTAATGGTTCATCTGATCATGTGAGCATTGATGCAACGAGTGTTCAGAATTTAGAAATTATTGAGCCACTTCACTCCAACCTTTGGGGAACAAGCAACAAGAAGAGAAGTCTGTTCCACATGCTCAAGACAACTAAAACTATAGGAGGGTCTAGACTTCTTCGTGCCAATCTTTTGCAGCCATTAAAAGATATTGAAACCATTAATGCCCGTCTGGATTGCCTGGATGAACTGATGAGCAATGAACAACTGTTCTTTGGGCTCTCTCAAGCTCTCCGTAAATTTCCTAAAGAGACTGACAGAGTACTTTGCCACTTCTGCTTCAAGCAAAAGAAAGTTACCAATGAAGTTTTGGGTGCTGATAATGCTAAAAAGAGCCAAAGTTTAATATCTAGCATTATTCTGCTGAAAACTTCTCTCGAGGCATTGCCTTTACTTTCAAAGGTGCTTAAAGAAGCAAAGAATTTTCTTCTTGCAAACATCTACAATTCTGTTTGTGAAAATGAAAAATTTGCAACCATTAGAAGGAGGATTGGGGAGGTCATCGATGAGGATGTTCTTCATGCTAGGGTTCCTTTTATTGCCCGCACTCAGCAGTGTTTTGCGGTCAAGGCTGGAATTGATGGACTGCTTGATATCGCTAGAAGGACATTTTGTGATACTAGTGAAGCAATACATAATCTTGCTAATAAATACCGAGAGGAGTACAAGCTGCCCAATTTAAAACTGCCATTTAACAATAGACAAGGGTTTTACTTGAGCATTCCTCGGAAAGATGTACAAGGCAAGCTTCCTAGCAAGTTTATTCAGGTCTTGAAGCATGGGAACAATATACGATTCTCTACTCTGGAACTTGCTTCTCTGAATGTTAGAAACAAGTCTGCAGCTGGAGAATGCTATATACGAACAGAAATTTGCCTGGAAGGACTGGTAGATGCCATAAGAGAGGACGTCTCTATGCTCACACTGCTTGCAGAAGTCTTGTGTCTCTTAGATATGATGGTTAATTCTTTTGCACATACAATATCTTCGAAGCCTGTGGATAGATATACTAGGCCAAATTTTACAGAAAGTGGCCCGATGGCAATTGAAGCTGCGAGACACCCAATCCTAGAAAGTATACACAACGATTTTGTTGCTAACAGTATATTTCTATCGGAAGCATCAAACATGATAATTGTCATGGGCCCAAATATGAGTGGAAAGAGTACCTACCTTCAACAAATGTGCCTTCTAGTTATTCTTGCTCAGATTGGATGTTATGTTCCAGCACAATTCTCAACCTTGAGGGTTGTTGATCGTATATTCACAAGAATGGGCACAGAAGATAGTCTAGAGTCCAACTCCAGCACATTCATGACAGAGATGAAGGAAACAGCTTTTGTGATGCAGAATGTCTCCCATAGGAGTCTCGTTGTCGTGGATGAACTTGGGAGGGCAACATCTTCTTCCGATGGATTTGCAATTGCATGGAGCTGCTGCGAATATCTTTTATCACTGAAAGCCTATACCATATTTTCCACTCATATGGACGGCCTATCAGAACTAGTAACCATCTATCCAAACGTAAAAGTTCTTCACTTCCATGTTGATATAAGGAATAACCGTTTGGATTTCAAGTTTCAACTAAAGGATGGAATTAGACATGTACCACACTATGGCCTTCTATTAGCAGAAGTGGCAGGACTGCCAAGCTCAGTTATTGAAACTGCAAGAAACATTACTTCCAGGATCTTGGAAAAGGAAGAAAGACGGATGGAGATAAACTACTTGCAGTACCATCCTATTAGAATGGCTTATAATGTAGCTCAGCGGTTGATTTGTTTGAAACATTCCAGTCATGATGAGGATTCAATCCGAGAAGCATTACAAAATCTTAAAGAAGGGTACATAAATGGGAGGCTGTGA |
Protein: MEDDGGERSSYVIGLIENRAKEVGVAAFDLRSASLHLSQYIETSSSYQNTKTLLHFYDPMVILVPPNKLAPDGMVGVSVLVDKFYVTVKKVVMARGCFDDTKGAVLIKNLAAKEPSALGLETYYKQYYLCLAAAAASIKWIEAEKGVIVTNHSLTVTFNGSSDHVSIDATSVQNLEIIEPLHSNLWGTSNKKRSLFHMLKTTKTIGGSRLLRANLLQPLKDIETINARLDCLDELMSNEQLFFGLSQALRKFPKETDRVLCHFCFKQKKVTNEVLGADNAKKSQSLISSIILLKTSLEALPLLSKVLKEAKNFLLANIYNSVCENEKFATIRRRIGEVIDEDVLHARVPFIARTQQCFAVKAGIDGLLDIARRTFCDTSEAIHNLANKYREEYKLPNLKLPFNNRQGFYLSIPRKDVQGKLPSKFIQVLKHGNNIRFSTLELASLNVRNKSAAGECYIRTEICLEGLVDAIREDVSMLTLLAEVLCLLDMMVNSFAHTISSKPVDRYTRPNFTESGPMAIEAARHPILESIHNDFVANSIFLSEASNMIIVMGPNMSGKSTYLQQMCLLVILAQIGCYVPAQFSTLRVVDRIFTRMGTEDSLESNSSTFMTEMKETAFVMQNVSHRSLVVVDELGRATSSSDGFAIAWSCCEYLLSLKAYTIFSTHMDGLSELVTIYPNVKVLHFHVDIRNNRLDFKFQLKDGIRHVPHYGLLLAEVAGLPSSVIETARNITSRILEKEERRMEINYLQYHPIRMAYNVAQRLICLKHSSHDEDSIREALQNLKEGYINGRL |